Corpus: epo_newscrawl_2017_100K

Other corpora

5.1.18 Words nearly always as next neighbors

Strong NN co-occurrences with a low probability of being separated

The quotient below is calculated as freq(word1)*freq(word1)/NN_freq^2.

Word 1 Word 1 Frequency of word 1 Frequency of word 2 Frequency as NN Qoutient
resendas ĉi-tien 45 36 34 1.40
Abi Talib 27 27 26 1.08
Orienthinda Kompanio 14 16 13 1.33
Notre Dame 10 10 9 1.23
Sarge Baldy 8 9 8 1.13
legala pagilo 8 8 8 1.00
Bocan Raton 5 7 5 1.40
Be'er Ŝeba 8 7 7 1.14
Abol-Gasem Ferdoŭsio 6 6 6 1.00
Soka Gakkai 5 6 5 1.20
Holivuda Raportisto 5 6 5 1.20
Montara Karabaĥo 7 5 5 1.40
Krea Komunaĵo 6 5 5 1.20
Nyota Uhura 5 5 5 1.00
linuksaj distribuaĵoj 4 5 4 1.25
vidbendaj kameraoj 4 5 4 1.25
Gliese 581 4 4 4 1.00
Indho Ade 4 4 4 1.00
Palo Alto 4 4 4 1.00
Frequently Asked 3 4 3 1.33
189 msec needed at 2018-02-26 22:43